Rank in Wordlist | Frequency | Word |
---|---|---|
6739 | 2061 | 1,5 |
7058 | 1957 | 2,5 |
9880 | 1285 | 3,5 |
11144 | 1106 | 1,2 |
12973 | 903 | 4,5 |
14387 | 786 | 1,3 |
14439 | 782 | 0,1% |
14440 | 782 | 0,2% |
14637 | 769 | 6,5 |
14664 | 767 | 0,5% |
Rank in Wordlist | Frequency | Word |
---|---|---|
996029 | 1 | all'11,2%(-0,2% |
Rank in Wordlist | Frequency | Word |
---|---|---|
475987 | 2 | .) |
640817 | 1 | .-) |
Rank in Wordlist | Frequency | Word |
---|---|---|
2855 | 5488 | 100% |
2864 | 5463 | 50% |
3035 | 5119 | 10% |
3177 | 4861 | 20% |
3363 | 4546 | 30% |
4293 | 3471 | 40% |
4390 | 3380 | 15% |
4759 | 3072 | 5% |
5119 | 2843 | 90% |
5487 | 2608 | 2% |
Rank in Wordlist | Frequency | Word |
---|---|---|
13167 | 887 | S&P |
29447 | 289 | B&B |
31136 | 267 | l’S&P500 |
35859 | 215 | U&D |
42594 | 164 | M&A |
47289 | 139 | Dolce&Gabbana |
48800 | 132 | H&M |
50495 | 125 | b&b |
52790 | 116 | D&D |
55217 | 108 | R&D |
Rank in Wordlist | Frequency | Word |
---|---|---|
62899 | 87 | A$AP |
69339 | 74 | A$AP Rocky |
164106 | 17 | l’$&P |
203298 | 11 | A$ap |
323503 | 5 | Se$$o |
406870 | 3 | Ca$h |
599084 | 2 | fe$$i |
647819 | 1 | 100-$110 |
656190 | 1 | 150$/h |
660770 | 1 | 19$/mese |
Rank in Wordlist | Frequency | Word |
---|---|---|
4618 | 3181 | ." |
640816 | 1 | .-" |
Rank in Wordlist | Frequency | Word |
---|---|---|
326 | 40696 | c'è |
1689 | 9366 | all'interno |
1697 | 9345 | C'è |
2051 | 7674 | l'ex |
2389 | 6611 | un'altra |
2427 | 6493 | l'uomo |
2458 | 6421 | l'Italia |
2484 | 6346 | l'ha |
2490 | 6315 | dell'Ucraina |
2719 | 5754 | d'Italia |
Rank in Wordlist | Frequency | Word |
---|---|---|
31683 | 260 | Apple TV+ |
32516 | 250 | 5+1 |
79322 | 59 | Azione/+Europa |
108817 | 34 | 3+1 |
110699 | 33 | 2+2 |
110703 | 33 | 2020+21 |
120097 | 29 | Shonen Jump+ |
121678 | 28 | Black+Decker |
123983 | 27 | 4+1 |
124144 | 27 | Baldini+Castoldi |
Rank in Wordlist | Frequency | Word |
---|---|---|
148271 | 20 | Sagittarius A* |
232700 | 9 | Sgr A* |
Rank in Wordlist | Frequency | Word |
---|---|---|
5637 | 2541 | e/o |
6398 | 2198 | km/h |
6619 | 2109 | https://www |
12114 | 989 | 2022/2023 |
14093 | 808 | 2022/23 |
17516 | 604 | dimessi/guariti |
18868 | 547 | 2021/2022 |
19763 | 516 | euro/litro |
23666 | 398 | 2/3 |
23857 | 394 | qualità/prezzo |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots